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Abstract: The Middle East respiratory syndrome coronavirus (MERS-CoV) is a virus that manifests itself in 
viral infection with fever, cough, shortness of breath, renal failure and severe acute pneumonia, which often result 
in a fatal outcome. MERS-CoV has been shown to spread between people who are in close contact. ‘Transmission 
from infected patients to healthcare personnel has also been observed and is irredeemable with present technology. 
Genetic studies on MERS-CoV have shown that ORF lab encodes replicase polyproteins and play a foremost role 
in viral infection. Therefore, ORF lab replicase polyprotein may be used as suitable target for disease control. 
Viral activity can be controlled by RNA interference (RNAi) technology, a leading method for post transcriptional 
gene silencing in a sequence specific manner. However, there is a genetic inconsistency in different viral isolates; 
it is a great challenge to design potential RNAi (miRNA and siRNA) molecules which can silence the respective 
target genes rather than any other viral gene simultaneously. In current study four effective miRNA and five 
siRNA molecules for silencing of nine different strains of MERS-CoV were rationally designed and corroborated 
using computational methods, which might lead to knockdown the activity of virus. siRNA and miRNA molecules 
were predicted against ORF lab gene of different strains of MERS-CoV as effective candidate using computational 
methods. Thus, this method may provide an insight for the chemical synthesis of antiviral RNA molecule for the 


treatment of MERS-CoV, at genomic level. 


Key words: MERS-CoV, RNAi, antiviral, gene silencing, computational method. 


1 Introduction 


Coronaviruses are enveloped single-stranded positive- 
sense RNA viruses with genomes of 25 to 32 kb, and the 
group includes the biggest known genomes among the 
RNA viruses (Wertheim et al., 2013; Orbalenya et al., 
2006), which infects and causes disease in a wide variety 
of species, including bats, birds, cats, dogs, pigs, mice, 
horses, whales and humans. Five types of coronaviruses 
are found to be circulating the human population (De 
Groot et al., 2013). A second coronavirus known as the 
Middle East respiratory syndrome coronavirus (MERS- 
CoV) has been recognized as the causal organism of a 
highly lethal pneumonia in patients in the Middle East 
and in travelers from this region about ten years af- 
ter the severe acute respiratory syndrome became epi- 
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demic (Memish et al., 2013). The virus MERS-CoV be- 
longs to the genus betacoronavirus (Zaki et al., 2012) 
which is also known as novel coronavirus (nCoV). Af 
ter the first report of causing human infection was pub- 
lished in September 2012, the World Health Organi- 
zation (WHO) International Health Regulations Emer- 
gency Committee denied the fact that MERS-CoV did 
not meet the criteria of becoming “public health emer- 
gency of global concern” in July 2013. Nevertheless it 
was termed “serious and great concern” and was also 
reported as a threat to the entire world. In June, 2012, 
for the first time virus was identified in a patient who 
died in Saudi Arab after developing acute respiratory 
distress syndrome (ARDS) and multiple organ dysfunc- 
tional syndrome (MODS) (Hui et al., 2014). In same 
year the first MERS disease broke out has been reported 
(Guery et al., 2013). Although the name suggests that 
it is limited to the Middle East, the death scinerio have 


been also reported in France, UK, Italy, Tunisia (Assiri 
et al., 2013). The symptoms include fever, cough, and 
shortness of breath. Although, MERS-CoV patients 
have primarily had respiratory disease, but a number 
of secondary complications have also been informed, in- 
cluding acute renal failure, multi-organ failure, acute 
respiratory distress syndrome (ARDS), and consump- 
tive coagulopathy. In addition, many patients have also 
reported gastrointestinal symptoms, including diarrhea 
(Guery et al., 2013). As mentioned earlier, this virus 
has the capacity to transmit from human to human. 
It was also shown that MERS-CoV was capable of in- 
fecting bats and porcine cell lines in addition to human 
cells. ‘These possessions would indicate a low barrier for 
transmission between hosts (Miiller et al., 2012; Eckerle 
et al., 2014; Annan et al., 2013). Recently, Dipeptydal 
peptidase 4 (DPP4) of primary human bronchial ep- 
ithelial cells and huh-7 cells are identified as the recep- 
tor of MERS-CoV virus entry. After entering into the 
cell the virus specially modulates the innate immune re- 
sponse, antigen presentation, mitogen-activated protein 
kinase. The genome structure of MERS-CoV is simi- 
lar to other coronaviruses, with the 5’ two-thirds of the 
genome encoding the non-structural proteins (NSPs) 
required for viral genome replication, the remaining 3’ 
third of the genome encoding the structural genes that 
make up the virion (spike, envelope, membrane, and 
nucleocapsid proteins), and four accessory genes inter- 
mingled within the structural gene region. The viral 
genome expression starts with the translation of two 
large replicative polyproteins, ppla and pplb, which 
are encoded by the viral replicase gene that compro- 
mises ORF la and 1b. This replicase complex not only 
mediates genome replication but also causes transcrip- 
tion of nested set of subgenomic mRNA-responsible for 
structural proteins (Van et al., 2012; Eric et al., 2000). 
After transcribing ppla, —1 ribosomal frame shifting 
is occurred to transcribe pplb which is subsequently 
cleaved into 15 or 16 nonstructural proteins (Eric et al., 
2000). The downstream region of ORF 1b is character- 
ized by containing a variable number of smaller genes, 
always including those encoding the spike (S), envelope 
(E), membrane (M), and nucleocapsid (N) structural 
proteins. ‘These genes are translated from subgenomic 
(sg) mRNAs that form a 5’ and 3’co-terminal nested 
set with the viral genome. Subgenomic mRNAs are 
composed of a common 5’ leader sequence that is iden- 
tical to the genomic 5’ region and a variable part of 
the 3’ quarter of the genome, with different ss mRNAs 
making different ORFs available for translation. The 
complement of the leader and “body” segments of the 
se mRNAs are assumed to be joined during discontin- 
uous negative-strand RNA synthesis. This step pro- 
duces the templates for sg mRNA synthesis and is di- 
rected by a base pairing interaction between conserved 
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transcription-regulatory sequences (TRSs) (Pasternak 
et al., 2006; Sawicki et al., 2007; Sola et al., 2011). Such 
TRSs are found at the 3’end of the leader sequence 
(leader TRS) and at different positions upstream of 
genes in the genomic 3’proximal domain (body TRSs). 
The synthesis of subgenome-length negative-stranded 
RNAs is directed by the complement of a body TRS 
at the 3’end of a nascent minus-strand base pairing 
with the leader TRS, with the extent of sequence com- 
plementarity being an important determinant of the 
level at which a given sg mRNA is produced (Van et 
al., 2012). In this study two types of gene silencing 
techniques were implemented computationally, siRNA 
and miRNA. siRNA also called silencing RNA is gener- 
ally double stranded RNA and contain 21 base pairs 
in length. ‘The central role of siRNA is post tran- 
scriptional gene silencing by interfering with the pro- 
tein and their basic mechanism is to bind and promote 
the degradation of messenger RNA (mRNA) at specific 
sequences (Taxman et al., 2006). An effective siRNA 
has to contain some basic features like A/U at the 5’ 
end of the antisense strand, G/C at the 5’ end of the 
sense strand, at least 5 A/U residues in the 5’ termi- 
nal end or one third of the antisense strand and ab- 
sence of any GC stretch of more than 9 nucleotide in 
length. If designed siRNA fails to fulfill above crite- 
ria then gene silencing is not possible by that partic- 
ular siRNA (Ui-Tei et al., 2008). Each siRNA strand 
contains a 5’ phosphate group and a 3’ hydroxyl (OH) 
eroup, produced from dsRNA or hairpin looped RNA. 
By using nanoparticles like polyethyleneimine and other 
approaches, siRNA can be introduced into the cell for 
specific knockdown of a gene of interest. After entering 
into the cell siRNA cleaved by an enzyme called dicer 
(RNase ITI-like enzyme). Then the siRNA incorporated 
with multi subunit protein complex called RNAi in- 
duced silencing complex (RISC). RISC seeks out the 
correct target mRNA, then siRNA unwinds and is be- 
lieved. After this process the antisense strand directs 
degradation of the complementary strand of appropri- 
ate mRNA using different kinds of endo and exogenous 
enzymes (Jackson et al., 2010). siRNA may also be used 
for therapeutic purpose like vaccine and their efficacy 
can be ensured to more than 75% to knockout or kill 
any targeted gene or protein (Chan et al., 2009). Ma- 
ture miRNA regulate diverse cellular processes in plant 
and animal species through the assembly of miRNA- 
induced RISC complex which binds to the complemen- 
tary targets in the cellular mRNA and subsequently 
cleaves or transcriptionally repress the mRNA depend- 
ing on the complementarity. In addition, recent studies 
suggest that miRNA also play a role in the regulation 
of viral infection (Filipowicz et al., 2008). Therefore, in 
the present study an effort has been made to identify 
potential siRNA and miRNA molecules for silencing of 
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ORFlab coding mRNA or gene in MERS-CoV using 
computational approach. 


2 Materials and Methods 


2.1 Sequence retrieval and analysis 


Nine genes of ORFlab of nine different strains 
of Middle East Respiratory Syndrome Coronavirus 
(MERS-CoV) were collected from the gene bank 
database available at National Centre for Biotechnolog- 
ical Information (http://www.ncbi.nlm.nih.gov/). This 
gene has the crucial role for further down regula- 
tion of other necessary genomic and structural repli- 
cator components. The accession numbers of these 
nine genes are JX869059.2, KF192507.1, KC667074.1, 
KC164505.2, KC776174.1, KF186567.1, KF186566.1, 
KF186565.1 and KF186564.1 for different strains of 
MERS-CoV. 


2.2 Open reading frame (ORF) screening and 
multiple sequence alignment 


With a view to screening out the coding regions 
of these nine different gene sequences of MERS-CoV, 
a web based interactive tool for predicting and an- 
alyzing ORFs from nucleotide sequences was used 
known as ORF Finder (http://www-.ncbi.nlm.nih.gov/ 
projects/gorf/). Afterwards, using CLASTALW 
(http://www.genome.jp/tools/clustalw/), individually 
all the coding regions for nine different strains were an- 
alyzed. The analysis showed 99.99% similarities among 
the coding regions. 


2.3 Potential designing of siRNA and miRNA 


To design a potential siRNA, siDirect 2.0 was 
used (Naito et al., 2009). While using this tool, 
some rules were marked as parameters including 
Ui-Tei, Amarzguioui, Renold rules as well as melting 
temperature which must be below 21.5°C to allow 
potential siRNA duplex. Apart from these other 
factors were also kept in mind on the concept of algo- 
rithms (Table 1). In similar fashion, for the purpose of 
designing a potential miRNA, another web based tool 
called IDT (Integrated DNA Technology), (http://sg. 
idtdna.com/Scitools/Applications/RNAi/RNAi.aspx) 
was utilized. Following this, the RNAi’s were submit- 
ted to RISC Binder for the prediction of functional 
guide strand of miRNA (Ahmed et al., 2009). 


2.4 Similarity search and target alignment 


For checking any off target sequence resemblance 
in other non-targeted organism’s genome, Blast tool 
(http://www.ncbi.nlm.nih.gov/blast) was used against 
whole Genebank database by applying expected thresh- 
olds value 10 and BLOSUM 62 matrix as parameter. 


2.5 GC calculation and secondary structure 
prediction 


DNA/RNA GC content calculator (http://www. 
endmemo.com>Tools>Biology) was used so as to cal- 
culate the GC content of the predicted siRNA. Con- 
sequently, for prediction of secondary structure and 
for computation of the free energy of folding for both 
siRNA and miRNA, mfold server (Zuker et al., 2003) 
was used. 


2.6 Calculation of RNA-RNA 
through Thermodynamics 


interaction 


To study the thermodynamics of interaction between 
predicted siRNA and target gene, RNAcofold program 
(Bernhart et al., 2006) was used. It calculates the hy- 
bridization energy and base-pairing form of two RNA 
sequences. It functions as extension of McCaskill’s par- 
tition function algorithm to compute probabilities of 
base pairing, realistic communication energies and equi- 
librium concentrations of duplex structures. Similar in- 
teraction study was performed for miRNA with RNA 
hybrid program (Rehmsmeier et al., 2006). 


2.7 Calculation of Heat capacity and concen- 
tration plot of duplex 


For both cases (miRNA and siRNA), the heat ca- 
pacity plot and concentration plot has been calculated. 
The ensemble heat capacity (Cp) is plotted as a func- 
tion of temperature, with the melting temperature Tm 
(Cp) indicated. The detailed heat capacity plot also 
shows the contributions of each species to the ensemble 
heat capacity, whereas concentration plot- Tm(Conc), 
the point at which the concentration of double-stranded 
molecules of one-half of its maximum value defines the 
melting temperature Tm(Conc). Both of these were 
performed by DNAMelt web server (Markham ez al., 
2005). 


2.8 Prediction of secondary structure 


The mfold web server contained UNAfold program 
(Zuker et al., 2003) was also used to predict the sec- 
ondary structure formed between mRNA and RNAi 
guide strands. 


3 Results and Discussion 


ORF lab gene is the first line open reading frame 
genes in its whole genome length in MERS-CoV which 
covers two third of length. In addition to its gene, its 
translational products also play active part in differ- 
ent functional aspects such as viral genome regulating 
transcriptase, protease and also encode replicase ma- 
chinery for own replication of virus (Eric et al., 2000). 
So to speak, it can be considered as an immediate early 
gene and silencing of this gene will result in termina- 
tion of its essential function. In accordance with the 


Table 1 
Ui-Tei Rules 


A/U at the 5’ terminus of the sense Strand 


G/C at the 5’ terminus of the anti-Sense strand 
position 6. 


At least 4 A/U residues in the 5’ terminal 7 bp 
of sense strand. 


No GC stretch longer than 9 nt 


present study, nine ORF lab gene sequences from nine 
different strains of MERS-CoV were obtained from gene 
bank of NCBI. All sequences subjected in ORF Finder 
to determine the cds of genes. ClastalW was brought 
into service in order to find out similarities among the 
cds. Results illustrate that they were more or less sim- 
ilar (99.99%). In next step of the study, siDirect, a 
web-based online software system for computing highly 
effective sIRNA was used. siDirect 2.0, an updated 
version of siDirect, which provides functional and off- 
target minimized siRNA design. All cds examined indi- 
vidually for siRNA against themselves through siDirect 
2.0. As Ui-Tei, Amarzguioui and Reynolds rules (U, 
R, A rules) govern siRNA sequence preference and in 
accordance with which highly effective siRNA can be 
readily designed. 469 siRNA suggested by siDirect 2.0 
which were tested for whether they follow URA rules. 
93 siRNA found to follow URA rules. Similarly for 
miRNA against same genes all sequences were posed 
to IDT where 45 RNAi found to meet the standards 
of IDT. As it is important to ensure whether RNAi 
binds RISC, results presented in Table 3 prove the suc- 
cessful incorporation of guide strand with RISC. In 
this study RISC Binder tool used to confirm the in- 
corporation of miRNA guide strand with RISC. Bind- 
ing ability of RNAi with RISC proves their capabil- 
ity of working as miRNA (Ahmed et al., 2009). Only 
four miRNA from forty five persists after the selection. 
ClastalW results of four miRNA revealed that three 
miRNA holds identical sequence based on which two 
eroup was maiden. One group contains identical three 
miRNA and the remaining one subjected to another 
eroup. For further validation of reduced off target, 
all siRNA’s and miRNA’s subjected to BlastN. All the 
siRNA and miRNA holds GC content within 36- 48% 
determined by GC content calculator. GC content of 
sIRNA and miRNA is a determinant of the perfection of 
their business as GC-rich target sites were avoided be- 
cause of their potential to be involved in strong bonding 
in their secondary structure while designing siRNA and 
miRNA (Bret et al., 2005). Lower GC content leads 
to stronger inhibitory effects and 30-57% of GC con- 
tent is considered sufficient for the execution of RNAi’s 
action (Liu et al., 2013). Because function of RNAi 


Amarzguioui Rules 


Duplex end A/U differential > 0 
Strong binding of 5’sense strand 


No U at position 1.Presence of A at 
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Algorithms or rules for rational design of siRNA molecules 


Reynolds Rules 


Each rule is assigned a score which is 
summed up to a total duplex score to 
improve the efficiency of siRNA. 


Weak binding of 3’sense strand. No G 
at position 19 


is determined by molecular structure, there is signifi- 
cant investment in computational methods for predict- 
ing RNA secondary structure (Hajiaghayi et al., 2012) 
and these Computational methods for modeling RNA 
secondary structure have proven to be valuable in many 
cases in which crystal structures are not available (Ding 
et al., 2005). Minimum free energy (MFE) is considerd 
as benchmark of RNA’s molecular structural accurecy 
(Mathews, 2005). Free energy minimization is a long- 
established paradigm in computational structural biol- 
ogy that is based on the assumption that, at equilib- 
rium, the solution to the underlying molecular folding 
problem is unique, and that the molecule folds into the 
lowest energy state (Ding et al., 2005). To measure the 
stability of structure of the guide strand, the minimum 
free energy of folding of was computed using mfold and 
those who possess least minimum free energy was tab- 
ulated (Table 2, Table 3 and Table 4). Therefore, neg- 
ative free energy of folding may have lower accessibility 
for target site. Reliable prediction of RNA-RNA bind- 
ing energies is crucial, e.g. for the understanding on 
RNAi, microRNA-—mRNA binding and antisense inter- 
actions. ‘The thermodynamics of such RNA-RNA in- 
teractions can be understood as the sum of two energy 
contributions: (1) the energy necessary to ‘open’ the 
binding site and (2) the energy gained from hybridiza- 
tion (Muckstein et al 2006). Hence the accurate com- 
putational treatment of RNA-RNA binding therefore 
lies at the heart of target prediction algorithms. The 
RNAcofold web server used to figure the hybridization 
energy and base-pairing pattern of two RNA sequences. 
RNAcofold, therefore, guides the structure prediction 
process by a combination of thermodynamic and ki- 
netic considerations. In addition to A GAB, the free 
energy of the heterodimer of sequence A and sequence 
B can be calculated using the equation AGbinding = A 
GAB- A GA-A GB (Bernhart et al., 2006). So, for the 
interaction of a target mRNA and its predicted siRNA, 
free energy of binding is A Gbinding = A GAB-AGA- 
AGB (Table 2). Free energy of binding for miRNA de- 
termined through RNA hybrid. siRNA and miRNA’s 
having highest energy of binding with target sequence 
shown in ‘Table 2, Table 3 and Table 4. The DINAMelt 
web server predicts entire equilibrium melting profiles 
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Fig. 1 (a)-(e): Secondary Structures of Target- siRNA Duplex 


as a function of temperature with melting temperature 
for a hybridized pair of nucleic acids. The heat capac- 
ity (Cp) as a function of temperature given in Table 2, 
Table 3 and Table 4. Tm (Conc) is the temperature 
at which the temperature of nucleic acid reaches half of 
its maximum value (Markham et al., 2005). It is known 
that the secondary structure can indeed cause inhibi- 
tion of RNAi (Vickers et al., 2000; Bohula et al., 2003; 
Nur et al., 2013). Also the secondary structure of RNAi 
can be involved in the RISC cleavage. That’s why pre- 
dicting secondary structures in order to guide selection 
of sites for RNAi targeting is more important (Bret 
et al., 2005). Secondary structures of RNAi provide 
a convenient and computationally tractable estimation 
not only to structures but also to the thermodynamics 
of RNA-RNA interaction. UNAfold predicts the most 
stable secondary structure of an oligonucleotide by min- 
imizing folding free energy. The folded structures are 
predicted at the specified temperature which is 37°C. 
The secondary structures of siRNA and miRNA are 
shown in Fig. l(a)-(e), Figure 2 (a)-(c) and Fig. 3. Af- 
ter analysis of all siRNA and miRNA, finally five siRNA 
and four miRNA passes all barriers of effectiveness and 
those are worthy to fight against MERS-CoV. RNA 
interference (RNAi) a novel regulatory and powerful 
silencing approach for molecular therapeutics (Bryan 


et al. 2014). Though both siRNA and miRNA have 
novel function for gene silencing but siRNA is compar- 
atively better than miRNA due to its sequence speci- 
ficity with targets and easier insertion techniques (via 
nano-perticle, lipid based transfection etc.) into the 
cell. Imbued by the mechanism and suitability of RNAi, 
scientists and pharmaceuticals are working to use it as 
next generation therapeutics. To treat a wide variety 
of disease RNAi technology is the riding lamp to re- 
searcher. Current discoveries on miRNAs’ involvement 
in cancer suggest that RNA inhibition can be used as 
an advanced mean for treating cancer patients Stahlhut 
et al., 2013). miRNA-based therapeutics is also in con- 
sideration for the treatment of cardiovascular disorders 
(Harada et al., 2014). 


4 Conclusion 


With RNAi technology it is possible to design a num- 
ber of siRNA and miRNA molecules for silencing of 
substantial genes in numerous biological systems. Their 
connections with target can also be calculated compu- 
tationally. Hence, in our study five siRNA and four 
miRNA molecules were predicted against ORF lab gene 
of different strains of MERS-CoV as effective aspirant 
using computational approaches. ‘These molecules may 
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Fig. 2 (a)-(c): Secondary Structures of Target-miRNA Duplex (Group 1) 


industry to advance antiviral therapeutics at genomic 
level. 
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